A-SFS: Semi-supervised feature selection based on multi-task self-supervision
نویسندگان
چکیده
Feature selection is an important process in machine learning. It builds interpretable and robust model by selecting the features that contribute most to prediction target. However, mature feature algorithms, including supervised semi-supervised, fail fully exploit complex potential structure between features. We believe these structures are very for process, especially when labels lacking data noisy. To this end, we innovatively introduces a deep learning-based self-supervised mechanism into problems, namely batch-Attention-based Self-supervision Selection(A-SFS). Firstly, multi-task autoencoder designed uncover hidden structural among with support of two pretext tasks. Guided integrated information from multi-self-supervised learning model, batch-attention generate weights according batch-based patterns alleviate impacts introduced handful noisy data. This method compared 14 major strong benchmarks, LightGBM XGBoost. Experimental results show A-SFS achieves highest accuracy datasets. Furthermore, design significantly reduces reliance on labels, only 1/10 labeled needed achieve same performance as those state art baselines. Results also missing
منابع مشابه
Feature Selection based Semi-Supervised Subspace Clustering
Clustering is the process which is used to assign a set of n objects into clusters(groups). Dimensionality reduction techniques help in increasing the accuracy of clustering results by removing redundant and irrelevant dimensions. But, in most of the situations, objects can be related in different ways in different subsets of the dimensions. Dimensionality reduction tends to get rid of such rel...
متن کاملA Convex Formulation for Semi-Supervised Multi-Label Feature Selection
Explosive growth of multimedia data has brought challenge of how to efficiently browse, retrieve and organize these data. Under this circumstance, different approaches have been proposed to facilitate multimedia analysis. Several semi-supervised feature selection algorithms have been proposed to exploit both labeled and unlabeled data. However, they are implemented based on graphs, such that th...
متن کاملMulti-Objective Semi-Supervised Feature Selection and Model Selection Based on Pearson's Correlation Coefficient
This paper presents a Semi-Supervised Feature Selection Method based on a univariate relevance measure applied to a multiobjective approach of the problem. Along the process of decision of the optimal solution within Pareto-optimal set, atempting to maximize the relevance indexes of each feature, it is possible to determine a minimum set of relevant features and, at the same time, to determine ...
متن کاملInfomation based supervised and semi-supervised feature selection
We merge the results from both of supervised and semi-supervised feature selection techniques. The method was applied to the five datasets from NIPS feature selection competition. As a preprocessing step, we firstly discretize each training dataset using EM algorithm. Then, we filter the discretized dataset based on the MI (mutual information) value of each feature with respect to the class var...
متن کاملForward Semi-supervised Feature Selection
Traditionally, feature selection methods work directly on labeled examples. However, the availability of labeled examples cannot be taken for granted for many real world applications, such as medical diagnosis, forensic science, fraud detection, etc, where labeled examples are hard to find. This practical problem calls the need for “semi-supervised feature selection” to choose the optimal set o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Knowledge Based Systems
سال: 2022
ISSN: ['1872-7409', '0950-7051']
DOI: https://doi.org/10.1016/j.knosys.2022.109449